Instance Selection: A Bayesian Decision Theory Perspective
نویسندگان
چکیده
In this paper, we consider the problem of lacking theoretical foundation and low execution efficiency instance selection methods based on k-nearest neighbour rule when processing large-scale data. We point out that core idea these can be explained from perspective Bayesian decision theory, is, to find which instances are reducible, irreducible, deleterious. Then, percolation establish relationship between three types local homogeneous cluster (i.e., a set with same labels). Finally, propose method an accelerated k-means algorithm construct clusters remove superfluous instances. The performance our is studied extensive synthetic benchmark data sets. Our proposed handle more effectively than state-of-the-art methods. All code results available at https://github.com/CQQXY161120/Instance-Selection.
منابع مشابه
Optimal instance selection for improved decision tree
Instance selection plays an important role in improving scalability of data mining algorithms, but it can also be used to improve the quality of the data mining results. In this dissertation we present a new optimization-based approach for instance selection that uses a genetic algorithm (GA) to select a subset of instances to produce a simpler decision tree with acceptable accuracy. The result...
متن کاملA Unied Bayesian Decision Theory
This paper provides new foundations for Bayesian Decision Theory based on a representation theorem for preferences de ned on a set of prospects containing both factual and conditional possibilities. This use of a rich set of prospects not only provides a framework within which the main theoretical claims of Savage, Ramsey, Je¤rey and others can be stated and compared, but also allows for the po...
متن کاملInstance selection for simplified decision trees through the generation and selection of instance candidate subsets
vi CHAPTER
متن کاملA Minimal Extension of Bayesian Decision Theory
Savage denied that Bayesian decision theory applies in large worlds. This paper proposes a minimal extension of Bayesian decision theory to a large-world context that evaluates an event E by assigning it a number π(E) that reduces to an orthodox probability for a class of measurable events. The Hurwicz criterion evaluates π(E) as a weighted arithmetic mean of its upper and lower probabilities, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i6.20578